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Introduction 

>  The  capacity  of  the  Gaussian  channel  without  feedback,  subject  to  a 
generalized  energy  constraint,  is  determined  in-B^-  In  that  work,  the  con¬ 
straint  is  given  in  terms  of  the  covariance  of  the  channel  noise  process.  How¬ 
ever,  there  are  many  situations  where  one  may  wish  to  determine  capacity  subject 
to  a  constraint  determined  by  a  covariance  that  is  different  from  that  of  the  channel 
noise.  An  example  is  in  jamming  or  countermeasures  situations. 

Channels  where  the  covariance  of  the  noise  is  the  same  as  that  of  the 
constraint  will  be  called  matched  channels;  otherwise,  we  say  that  the  channel 
is  mismatched  (to  the  constraint).  In  this  paper,  the  capacity  of  the  mismatched 
Gaussian  channel  is  determined  for  two  situations:  the  finite-dimensional  channel, 
and  the  infinite-dimensional  channel  with  a  dimensionality  constraint  on  the  space 
of  transmitted  signals.  Results  on  the  infinite-dimensional  mismatched  channel 
without  a  dimenionality  constraint  on  the  signal  are  given  el sewhere^[2]\  Various 
special  cases  of  the  mismatched  channel  have  been  treated  previously  [3]-  [5].* 

The  results  for  the  mismatched  channel  differ  significantly  from  those  for 
the  matched  channel.  A  discussion  of  these  differences  follows  the  proof  of  the 
main  result. 
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Definitions  and  Structure 

The  channel  is  defined  as  in  [1].  ^  and  H 2  are  real  separable  Hilbert 

spaces  with  Borel  o  fields  3-|  and  ^  anc^  inner  products  <**->-|  and  <•>•>£. 

The  message  process  X  in  is  represented  by  a  probability  ux  on 

The  message  is  encoded  into  the  transmitted  signal  A(X)  in  H2  by  a  B^/Bg-measurable 

coding  function  A.  To  each  sample  function  of  the  signal  process,  the  channel 

adds  a  sample  function  from  the  noise  process  N,  represented  by  a  Gaussian  measure 

y  on  (H0,B-J.  The  received  signal  (channel  output)  is  then  a  sample  function 
N  t 

from  the  process  Y  =  A(X)  +  N,  represented  by  the  measure  yy.  As  usual  X  and  N 
are  assumed  independent,  so  that  yy(B)  =  yx®y^{(X,Y):  A( X)  +  YeB}  where 

is  product  measure.  The  channel  probability  uXy,  which  has  marginal  measures 

ux  and  yy,  is  a  measure  on  (H-j  x  H2,  8-j  x  B2)  defined  by  yXY  (C)  = 

yx0yN  { ( X , Y ) :  (X,A(X)  +  Y)eC).  The  average  mutual  information  is  then  I[uXy], 

where  I[uXy]  =«if  it  is  false  that  yxy  is  absolutely  continuous  with  respect 

to  ux8uy  (yXy«yx0yy),  and  otherwise  I[uXy]  *  I  109  [dyJJyj  (x,y^  dwXY 

H1xH2  X  Y 

The  information  capacity  is  then  s»jp  I[yXy],  where  Q  is  a  set  of  admissible 
pairs  ( y x , A) . 

For  this  paper,  a  covariance  operator  in  a  Hilbert  space  will  be  defined  to  be 
a  symmetric  and  trace-class  bounded  linear  operator.  The  constraint  on  the 
transmitted  signal  process  A(X)  will  be  given  in  terms  of  a  covariance  operator 
Rw  in  H2;  as  is  well-known,  to  every  such  covariance  operator  there  corresponds  a 
zero-mean  Gaussian  measure  on  (Hg » 62 ) - 

When  H2  =  I-2  [0,T]  and  R  is  a  covariance  operator,  R  can  be  represented  as  an 
integral  operator  with  kernel  R  which  is  a  covariance  function.  There  is  then  a 
well-known  isomorphism  between  range  (R^/6)  ^nd  the  reproducing  kernel  Hilbert  space 
of  R.  All  measures  considered  here  will  be  assumedd  WLOG  to  have  zero  mean.  The 
capacity  will  he  determined  under  the  following  assumotions: 


(A-l)  =  Rj  (I  +  S)  R.f  ,  where  is  the  covariance  operator  of  the  noise 

measure  and  S  is  a  compact  linear  operator  that  does  not  have  -1  as  an 
eigenvalue; 

(A-2)  The  admissible  set  Q  is  the  set  of  all  (yY.  ,A) 

J1  * 

d  |  |R^2  A(x) 1 |2  dyx(x)  ^  P,  where  P  >  0  is  fixed. 

1  _ „  _l/2 

It  will  be  assumed  WLOG  [1]  that  range  (Rw)  =  H9,  so  that  Rw '  exists. 

1  N  1  N  l/?  */2 

(Rn  )*  range  (Rw  ) 

+  S)  ^2U*,  where 

U*  is  the  adjoint  of  U. 

The  class  of  all  zero-mean  Gaussian  measures  y^  with  covariance  operator  as 
in  (A-l)  includes  all  those  that  are  mutually  absolutely  continuous  with  respect 
to  yw,  where  yw  is  zero-mean  Gaussian  with  covariance  Rw[6 ] . 

From  the  results  of  [1],  one  can  limit  attention  to  cases  where  y^xj  is 
Gaussian  with  covariance  operator 


-  '/O 

Assumption  (A-l)  then  implies  that  Ru '  exists;  in  fact,  that  range 

"  1  1 

*/? 

Thus,  there  exists  a  unitary  operator  U  in  H2  such  that  R^  =  Rj  (I 


7  72 

RA(X)=  WV  (2). 

where  xn  >  0  for  n  _>  1 ,  Zn  tn  <  «,{un,  n  _>  1 }  is  a  c.o.n.  set  and  (ubv)x  =  <v,x  >  u. 

When  u/\(x)  has  (2)  ^or  covariance  and  is  Gaussian  then  [1] 


I[yXyl  =  (V2)  Zn  log  [1  +  xn]. 


(3) 


Moreover, 

£ux,|Rw'1/Zfl<S>M|  -  Trace  R^/2  RA(x)v'/2 


=  Z  tn  |  { (I  +  S)V2  U*unl  ||  .  (4) 

Defining  X2  =  xn 1 1 ( I  +  S)  ^2  U*un | J 2 *  the  capacity  problem  is  thus 

reduced  to  maximizing  {l/2  }  ,  ]og  +  ,2  (]  +  ^-1]  (5) 

over  all  sequences  (X2)  and  c.o.n.  sets  {v„,  n  >  1}  such  that  z  X2  <  P, 

n  n  —  n  n  — 

where  yn  =  <Svn«  vn  >  2’  n  -  1- 


The  supremium  of  (5)  subject  to  the  stated  constraint  is  the  capacity 

and  will  be  denoted  as  CU(P);  the  capacity  for  the  matched  channel  ( R, ,  =  RJ 

"  W  N 

will  be  denoted  by  CN(P). 


Preliminary  Results 

Lemma  1 :  Let  (yp)»  n  _<  M,  be  any  non-decreasing  sequence  of  strictly  positive 
real  numbers.  Let  (Xn)  be  any  sequence  of  M  real  numbers.  Fix  P>0,  and  define 


g(M,P,Y)  =  sup 
M  o 

{X:  I  xj;  <  P} 


n  (y  +  X^)/y  . 

'  rn  n"  Tn 


K  K 


Then  g(M,P,y)=  n  U  Y.  +  P)/(KY  ) 

n-1  K 

where  K  <  M  is  the  largest  integer  such  that  E  v.  +  P  >  Ky^.  g(M,P,Y)  is  uniquely 

1  i  -  k. 


attained  by  (X^)  such  that 


Proof:  Define  fM:  IR 


Xn  =  {*  Yi  +  P)/K  '  Yn  n  -  K 
=  0  n  >  K. 


->  IR  by  fM(yJ  =  I  log  [1  +  yp  y”1  ] • 
n=l 


We  seek  to  maximize 


fM  subject  to  the  constraints 

g(y.)  =  ?  yn  -  P  1  o 
1  n 

hi(^)  =  1  0.  -i  =  1,..,  M. 

This  is  a  constrained  optimization  problem  with  objective  function  fM  which  is  concave 

u 

over  the  convex  set  {Z  in  IR  ;  Z.  >_  0,  i  =  1,..,M).  Moreover,  each  constraint 
function  is  linear.  Thus,  any  solution  to  this  problem  will  define  a  unique 
global  maximum  for  fM  [7].  In  order  that  be  a  solution,  it  is  necessary  and 
sufficient  that  the  following  set  of  equations  be  satisfied  [7]: 


+  8-^=0  i  =  1 , . . ,  M 


yT  +y. 


M  M 

L  y*  -  P  <  0,  B[s  y?  -  P]  -  0 


(7) 


for  some  set  of  non-positive  real  numbers  {g, a ^  . .a^}. 

We  first  attempt  to  obtain  a  solution  by  setting  =  ...  =  =  0. 

This  requires  e(yi  +  y*)  =  -1  for  i  =  1 ... ,M;  thus, 

MM  ,  M  *  M 

i  y*  +  Zy.  »  -Me  ,  and  so  y*  =  (e  y.  +  Zy. )/M  -  y 
1111  n  1  1  1  1  n 

for  n  =  1,  2,..,M.  This  definition  of  y*  and  the  constraints  (8)  require  that 
M  M 

ry*+EYi>MYn 
1  1  1  1  n 

for  n  <  M;  this  inequality  is  satisfied  for  all  n  <  M  if  and  only  if  it  is  satisfied 

1  ~  M  * 

for  n  -  M.  Also,  8  =  -(y,-+  y,)  for  i  <  M  implies  3<0,  so  that  E  y  •  =  P  by 

M  1 

constraints  (7).  Thus,  if  P  +  E  y.j  >.  M y  an  optimum  solution  is  given  by 

*  M  1 

y,  =  (P  +  Eyn  -  My,)/M,  i  <  M  . 

K 

If  there  exists  K<M  such  that  KyK  <  P  +  Ey.,-  <  ( K+l )yk+i * 

1  K  _1 

then  constraints  (6 )- (8)  are  satisfied  by  choosing  3  =  -K[P  +  Ey.:]  , 

1  ' 

a-j  —  -  -*  8 , 

K  * 

sy«  -  p 

i  1 

y*  =  0,  i  >K 
i 

^  =  K  ^P  +  Eyn  -  Kyi  ],  i  <_  K 

K  '  _i  _i 

ctj  =  -K[P  +  Eyn]  +  Yj  i  *  K 


Thus, 


sup 

M  2 

(X:  EX  <  P} 


n(Yn  +  X  )/y  =  n  (E  Y  ♦  P)/(KY  ) 
n=l  n  n  n  n=l  1  1  n 
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where  K  <  M  is  the  largest  integer  such  that  £ y.  +  P  >  Ky„.  The  supremum  is 

1  i  -  K 


attained  by  y*  as  defined  above,  or  for 

o  K 


Xn  =  CP  +  ^]/K-  yn 
=  0 


n  <  K 


n  >  K  . 


Lemma  2:  Let  (a.),  1  <  i  <  K,  be  a  non-decreasing  sequence  of  strictly 

positive  real  numbers.  Suppose  that  (y  )  is  a  non-decreasing  sequence  such 
J  J  K 

that  Zy.  _>  IA.  for  all  J  <  K,  and  let  P  >  0  be  such  that  Zy.  +  P  >  Ky...  Define 

1*1'  il  In 


□ 


f^y)  =  n  (P  +  Zy. )/(Ky  ).  Then  f„  (y)  <  f,. ( A )  with  equality  if  and  only  if 
*  ~  n=l  1  1  n  K  *  ~  K  - 

y.j  =  A^  for  all  i  <  K. 

K 

Proof:  For  any  fixed  n,  3fK(y)/3yn  is  negative,  using  Zy^  +  P  _>  Ky^.  Thus  fK(y) 

K1  K 

increases  for  y„  decreasing.  One  can  now  assume  that  Zy„  -  za  .  .  To  see  this, 
n  3  1  n  1  n 


suppose  Zyn  >  ZAn-  First  assume  that  there  exists  p  _<  K  such  that  yp  >  yp_1 


and 


Yp  >  V  Define  a  sequence  (yp  )  by  yn  =  yR  if  n  t  p,  y^  =  yp  -  e. 


e  =  min  (y 


K 

,  £  (' 

1 


p  Vt  i  ui  ‘  V’  tp 


v- 


Continuing  to  form  new  sequences  in  this  manner,  one  will  eventually  obtain  a 

J  J  K  K 

non- decreasing  sequence  (yn')  with  Zy^  >_  ZAp  for  all  J  <_  K,  and  either  Zy^  =  ZAn 

I  * 

or  else  y^  =  y^  =  ...  =  yp,  where  p  is  the  largest  integer  i  such  that  y^  >  A.. . 

n  n  i 

If  the  latter  case  holds,  define  a  new  sequence  (y_),  with  y  =  y  for  2  <  n  <_  K, 

Kn  n  n 

I  it  J 

while  y*|  *  y-j  -  e,  e  =  min  (y^-Ap  z(y-  -  ) ) .  (yn)  is  non-decreasing  and  Sy“  > 

1  |y  11 

I  II  H  ■'I 

If  e  r  y-|  -  A-j ,  the  procedure  is  repeated  for  (yn)  and  yg;  if  e  =  £(y.j  -  A..),  the 

il  ii 

procedure  is  repeated  for  (y  )  and  y, .  Continuing  in  this  manner  will  eventually 

K  1  „  K 

produce  a  sequence  (y* )  such  that  z  t  =  £*  . 


1 


1 


MCi 


Assume  then  that  z  y  =  z  a  .  If  y  and  a  are  not  identical,  let  p  <  K 

1  n  1  n  '  '  P-1  p-1  P  ~P 

be  the  largest  integer  such  that  yp  t  Ap'»  since  E  yn  >  z  Ap  and  z  =  Z  An, 


y  <  A  .  Let  t  <  p  be  the  largest  integer  such  that  yt  >  At;  such  t  must  exist. 
P  P  CL 

I  I  I 

Define  a  new  sequence  (yn)  by  yn  =  yn  if  n  f  t,  n  f  p,  while  yp  =  yp  +  e, 

I 

Yt  =  Yt  "  e’  £  =  inf  (Xp  "  Yp’  Yt  "Xt^*  fK(y)  <  VY’>’  since  (vt  -e)  (yp  +  e) 

2 

=  y^Yp  -  e(yD  -  y^)  -  e  ,  and  Yp  Yf  This  procedure  is  successively  repeated; 
it  will  terminate  when  and  only  when  one  obtains  a  sequence  (y  )  such  that 

I 

Y„  =  A  for  all  n  <  K. 
n  n  — 


Main  Result 


Theorem: 

(a)  Suppose  that  Hp  has  dimension  M<®.  The  capacity  is  then 


CW(P)  )  Z  log  *  Qi  +  P  +  K 

n=1  rrnnrn 


where  0-j  <  flg  <  ..  <  ef)  are  the  eigenvalues  of  S,  and  K  is  the  largest 

K 

integer  <  M  such  that  ie.  +  P  >  K  e.,.  The  capacity  is  attained  by 

1  1  -  K 

a  Gaussian  with  covariance  operator  (2),  where  un  =  Uen  and 


•  K 

rn  =  ZQi 

n  I  1  1 


+  P  -  K9n I (1  +  en)“  K"  for  n  £  K,  in  =  0  for  n  >  K,  and 


ien»  n  >  1)  are  o.n.  eigenvectors  of  S  corresponding  to  the  eigenvalues  (en). 


No  other  Gaussian  can  attain  capacity.  The  same  result  is  obtained 

if  H,  has  dimension  L  < ®  and  y./v\  is  constrained  to  have  support  of  dimension 


(b)  Suppose  that  is  infinite-dimensional  and  that  support  (u^^)  is 
restructed  to  have  dimension  <  M  <  «.  Let  Un),  n  _>  1 ,  be  the  non¬ 
decreasing  strictly  negative  eigenvalues  of  S. 

(i)  If  {An,  n  >_  1 }  is  empty,  then  C^(P)  =  (M/2)  log  [1  +  P/M].  Capacity 

can  be  attained  if  and  only  if  S  has  zero  as  an  eigenvalue  of  multiplicity 
_>  M.  In  this  case  Cw( P)  is  attained  by  a  Gaussian  with  covariance  (2) 

where  u^  =  Ue^  and  =  P/M  for  i  £  M,  with  {e-|,..,eM}  any  o.n.  set  in 


the  null  space  of  S. 

K 

(ii)  If  KxK£EA.j  +  P  <  Kak+.|  for  some  K  £  M,  then  the  capacity  is  as  in  (a), 
with  e.  =  a.,  i  =  1,.,K,  and  can  be  similarly  attained. 

K 

(iii)  If  S  has  K  <  M  strictly  negative  eigenvalues,  K>0,  and  P  +EA.  >  Ka.,,  then 

1  i  -  K 


the  capacity  is 


CW(P)  =  (M/2)  log 


P  +  M  +  £  A. 


(  /2  )  z  log  (1  +  A  ) . 
n=l  n 


The  capacity  can  be  attained  i f  and  only  if  zero  is  an  eigenvalue  of  Swith 
multiplicity  £  M-K.  The  capacity  is  then  achieved  by  a  Gaussian  with  covariance 

K  _i  _i 

(2),  where  un  =  Uen  and  xp  =  (EA^  +  P  -  MAp ) ( 1  +  Ap)  M  for  n  £  K,  with 

K 

Sen  =  A  e  and  {e,  ,  ...e..}  an  o.n.  set;  and  with  u„  =  Uv^  and  x  =  (P  +  Ex.)M 
n  n  n  i  k  nnn  -J  i 

for  K  +  1  £  n  £  M,  where  Svn  =  0  and  {vK+j  vM>is  an  °'n’  set‘  ^he  sets 
{u^.,  uK)  and  {x-j,.,  are  uniquely  defined  for  any  maximizing  Gaussian  uA(X)- 
(c)  In  (b-ii)  and  (b-iii),  the  capacity  is  strictly  greater  than  for  the  case 


Rn  =  i-e.,  CU(P)  >C„(P).  In  (b-i),  CU(P)  =  CM(P).  In  (a),  CU(P)  >  CM(P)  if 


E9.  <  0,  or  if  P  +  E0-  <  0. 
1  1  “  1  1 


CW(P)  <  C^(P)  i f  0  ©i  <  0|^. 


Proof:  (a)  From  (5), 

i  M  2-1 

CW(P)  =  sup  ( / 2)  Z  log  [1  +  X  n  Yn  ],  where  Yn  =  1  +  <Svn>  vn  >  2>  (vn,  n  <  M) 

2 

is  a  c.o.n.  set,  and  the  supremum  is  over  all  such  c.o.n.  sets  and  all  (Xn) 

M  2 

such  that  EXn  £  P.  Since  0-j  £  62  -  •*  -  0m  are  non-decreasing  eigenvalues  of 
J  J 

S,  Z  [1  +  <  $v  ,  v  >9]  —  ,  [1  +  0  ]  for  all  J  <  M  and  any  fixed  c.o.n.  set 
1  n  n  c  1  n 

tv  ,  n  £  M}.  The  expression  of  C^(P)  in  (a),  and  the  unique  covariance  of  the 
maximizing  Gaussian  v^^y  both  now  follow  from  Lemma  1  and  Lemma  2.  The  same 
result  holds  if  dim(H2)  =  L  <  =°  and  dim  [supp(vi^X))]  1  M<L,  since  in  this  case 
S  again  has  M  smallest  eigenvalues. 


(b)  If  S  is  strictly  positive,  then  S  being  compact  implies  that  S  does  not 
have  M  smallest  eigenvalues.  However,  given  any  e>0,  one  can  find  eigenvalues 
Yp...,  y^  such  that  0  <  y^  <  £  for  i  1  M.  Using  this  in  (3)  one  obtains 

1  M 

I[yxy]  =  (  / 2)  £  log  [1  +  xn] 

1  M  9  l 

-(72)  E  log  [1  +  X*  (1  +  Yp)~  J 

1  M  2  _i 

>(72)  E  log  [1  +  X  (1  +  t) 

1  M 


The  expression  on  the  right  of  the  inequality  is  maximized,  over  all  (X*)  such  that 

M  2  2  ,  M  ] 

<  P,  by  defining  xf  =  P/M,  n  <  M.  Thus,  CjP)  >  '/2  z  log  [1  +  (1  +  e)  1  P/M] 

■j  n  n  —  w  1 


I, 


M 


for  all  €>0,  and  so  CU(P)  >  /2  z  log  [1  +  P/M].  For  the  reverse  inequality, 

1  1 

-72  2 

one  notes  that  under  the  constraint  E  llRw  4A(X)|L  <  P,  it  is  shown  in  [1] 

UX  w  1  c  2  1  2 

that  CN(P)  =  (M/2)  log  (1  +  P/M).  For  S  £  0,  |  |R“  /2  A(X)  |  |g  J I  lRy  /2|  |A(X)  ll2. 


so  that  the  solution  for  C^(P)  is  the  supremum  over  a  larger  set  than  for  Cy(P); 


i.e.  CN(P)  i  CW(P).  Thus  CW(P)  <  (M/2)  log  [1  +  P/M],  so  that  CW(P)  =  (M/2)  log  [1  + P/M] 
If  S  ^  0,  with  zero  an  eigenvalue  of  multiplicity  K,  the  above  argument  is 


modified  in  an  obvious  way  (yf  =  0  for  i  =  1,..,  min  (K,M))  to  again  obtain 
CW(P)  =  (M/2)  log  [1  +  P/M]. 


To  prove  (b-ii),  the  proof  of  (a)  is  repeated  after  substituting  A ^  for  e. , 

i  <  M. 


Now  suppose  that  S  has  K<M  strictly  negative  eigenvalues  <  ...  <  A 
K  l  -  - 

and  that  EA  .  P  >  Ka„.  C..(P)  =  sup  Cn(P.  ,v)  where 

li  "  K  W  (Pr  v)  W  1  - 


cw(Prv)  =  sup 


1  M  2  1 

2  £  log  [1  +  X;  (1  +  <Sv  ,  V  >  )-'] 

6  n=l  n  n  n  2 


v  =  (v^ ,  n  £  M)  is  any  o.n.  set,  0  £  P-j  <_  P,  and  the  supremum  is  over  all  (X^) 

K  2  M 

such  that  E  X  <  P, ,  i  X  <  P.  Repeating  the  analysis  of  (a)  and  (b-i),  one  finds  that 
1  n  i  n 

"  J 

fAi  +  PT  =  J 

i  u  i  '  '  1 

UP,  ,v)  =('/ 2  )  E  log 
w  1  n=l 


J(1  +  Ap) 


P  -  P  ~\ 

+  (]/2  )  (M-K)  log  [1  +  M  .  k1  J 

J 

where  J  £  K  is  the  largest  integer  such  that  E  A.  +  P,  >  Ja..  Since  this  result  holds 

•j  I  l  —  J 

for  any  o.n.  set  (vn,  n  <_  M} ,  it  remains  only  to  determine  the  value  of  P-|  that 

maximizes  CW(P^,  _v)  (a  differetiable  function  of  P^  in  [0,P]).  Differentiating, 

one  sees  that  Cw(P-j,  v)  is  increasing  with  P-|  so  long  as 
J  ,  J 

P-j  <[JP  -  (M-K)  ea^ ]  (M-K  +  J)  .  Since  P-j  <  Ja^  -  EA^ ,  this  inequality  is  satisfied 


if  (M-K  +  J)x,  -ex.  <  P,  which  always  holds  because  P  +  ex.  >  Kx,,  and 

J  1  1  *|  1  l\ 


\j  1  <  ^  thus  follows  that  C^(P^,  v)  is  an  increasing  function  of  P^ 

K  K 

for  P^  <  -  n.  +  Kx^.  Assumming  that  P^  _>  -  ex  +  Kx^»  the  maximum  of  Cw(P)-j ,  v  ) 

K  -1 

is  attained  uniquely  by  P-j  =  [KP  -  (M-K)  ex  ]  .  Using  the  value  of  P^  in 


the  expression  for  C^(P^,  v_),  one  obtains  C^(P)  as  in  (b-iii).  The  statement 

on  attaining  capacity  follows  from  the  results  of  (b-i)  and  (b-ii). 

To  prove  (c)  for  (b-ii)  and  (b-iii),  it  suffices  to  note  that  one  can  obtain 

a  capacity  of  CW(P)  =  (M/2)  1  og(  1  +  P/M)  by  specifying  that  {u^,  n  <_  M}  be  orthogonal 

to  the  subspace  spanned  by  (Uen,  n  <  1  },  where  {en,  n  >_  1}  are  o.n.  eigenvectors 

of  S  correpsonding  to  all  the  strictly  negative  eigenvalues  (x  );  to  see  this,  one 

n  M 

can  apply  (b-i).  The  statement  on  (a)  for  0  £  e.|  <  eM  is  clear.  If  te  <  0, 

M 

then  choosing  tn  =  P/M  for  n  £  M  gives  E-rn(l  +  e  )  £  P;  thus  the  constraint  (A-2) 
is  satisfied.  Together  with  any  choice  of  an  o.n.  set  (un,  n  £  M},  this  definition 

of  RA(X)  as  in  (2)  with  Gaussian  gives  I[uxy]  =  (M/2)log[l  +  P/M]. 


Since  the  maximizing  Gaussian  measure  is  unique,  and  is  given  as  in  (a), 

K 

CW(P)  >  (M/2)  log  [1  +  P/M]  =  CN(P).  If  P  +  ze.  £  0,  then  C^P)  >  P/2 
follows  by  log  x  ^  _>  1  -  x. 


CN(P) 


> 


Discussion 


In  the  matched  channel,  C^(P)  =  (M/2)  log  (1  +  P/M)  if  is  M-dimensional , 
or  if  is  infinite  dimensional  with  dim[support  (m^X))]  1m  [1 ,  Theorem  1]; 
moreover,  the  capacity  can  be  attained  in  each  case  by  choosing  u^(x)  *5e 

Gaussian  with  covariance  (2),  with  (u^  u^)  arv^  M  o.n.  vectors  in  H ^ 

and  with  xn  =  P/M  for  n  _<  M. 

Thus,  it  is  seen  that  the  mismatched  channel  differs  in  several  ways.  First, 

the  value  of  the  capacity  is  different  unless  S  ^  6.  Secondly,  the  problem  of 

attaining  capacity  is  much  more  significant.  Even  in  the  finite-dimensional 

channel  the  vectors  u.  uM  must  be  a  specific  set  of  vectors,  not  just  any  o.n. 

set.  If  is  infinite-dimensional  with  dim[supp(u^^j )]_<  M,  the  situation 

is  even  worse  except  for  (b-ii).  That  is,  capacity  can  be  attained  only  if  S  has 

zero  as  an  eigenvalue  of  multiplicity  >_  M  if  S  <_  0,  or  Of  multiplicity  >_  M-K  if 

K 

S  has  K  <  M  strictly  negative  eigenvalues^  <  ..  <  ^  and  P  +  EX^  £  Kak. 

Otherwise,  in  order  to  approach  capacity,  one  will  need  to  put  part  of  the  availabl 
"energy" P  in  elements (Ue^)  where  (en)  are  eigenvectors  of  S  corresponding  to 
successively  smaller  eigenvalues.  In  practical  applications,  this  usually 
corresponds  to  eigenfunctions  at  higher  and  higher  frequencies. 

Thus,  we  conclude  that  not  only  does  the  capacity  Cw( P)  have  a  different 
value  than  C^( P )  (except  when  S  _>  0),  but  attaining  or  approaching  capacity  is 
significantly  more  difficult  in  the  mismatched  channel  than  in  the  matched  channel 

It  may  be  noted  that  the  results  given  in  (a)  and  (b-ii)  of  the  Theorem  are 
similar  to  those  obtained  in  [4,  p.  170],  although  the  developments  are  quite 
different.  However,  these  previous  results  are  given  in  terms  of  a  constraint 
on  E J  J A(S) 1 1 g *  and  assume  that  the  noise  variance  components  can  be  arranged 
in  ascending  order.  This  can  only  be  done  if  the  channel  is  finite-dimensional.  I 
this  case,  one  can  take  Rw  =  I,  the  identi ty,  and  thereby  use  a  true  power  constraint 
The  assumption  (A-l )  becomes  =  I  +  S,  and  the  capacity  is  as  given  in  (a);  this 
agrees  with  the  referenced  results  in  [4]. 
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